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ABSTRACT 

The  problem  of  selection  of  the  best  multivariate  population 
is  given  a  new  formulation  which  does  not  involve  reducing  the 
populations  to  univariate  quantities.  This  formulation's 
solution  is  developed  for  known,  and  (using  the  Hetercscedastic. 
Method)  also  for  unknown,  variance-covariance  matrices.  Prefer¬ 
ence  reversals  and  arbitrary  nonlinear  preference  functions  are 
explicitly  allowed  in  this  new  theory. 


1.  INTRODUCTION 

Let  7T  be  a  multivariate  normal  population  with  p  (>_  1) 
variates,  mean  vector  and  positive  definite  variance- 
covariance  matrix  ^  (i  =  l,...,k);  i.e.  ,  let  flu  be  N  (y^, 

(i  =  l,...,k).  The  problem,  of  selecting  that  one  of  tt^  , . . .  ,-ny. 
which  is  "best"  in  some  precise  sense  is  an  important  one  which 
arises  frequently  in  practice  (e.g.  see  Chapter  15,  pp.  341-394, 
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of  Gibbons,  Olkin,  and  Sobel  Q977)).  However,  "The  whole  field 
is  as  yet  undeveloped  and  the  reader  [of  Gibbons,  Olkin,  and 
Sobel  (1977),  p.  390]  is  encouraged  to  regard  this  chapter 
[their  Chapter  15]  as  an  introduction  to  a  wide  area  that  will 
see  considerable  development  in  the  future  as  more  meaningful 
models  are  formulated."  This  current  paper  deals  with  the  devel¬ 
opment  of  such  a  model. 

Host  often,  studies  in  this  area  have  started  by  assuming 
given  a  known  function  4>  ‘  '{'(u,  f )  which  is  real-valued.  One 
then  defines  <j>,  =  ij>(y,,  C ,  £k),  defines  the 

population  associated  with  the  largest  [sometimes,  the  smallest] 
of  1^,.,.,$,  as  "best",  and  then  proceeds  to  develop  ranking  and 
selection  procedures  for  the  resulting  univariate  problem.  For 
example ,  one  often  then  seeks  to  develop  a  procedure  6  such  that , 
for  a  fixed  F*  (1/k  <  P*  <  1)  the  probability  of  a  correct 
selection  satisfies 

inf  P(CS|  *)  =  P* 

where  ftp  is  some  subset  of  the  parameter  space  ft  of  Up,...,^, 
,fk.  (Let  bj-p]  <.  .  ■<_  b[v]  denote  •  •  >4>k  in  numerical 
order.  Then  if  "CS"  means  selection  of  the  population  with  4>[k] 
and  ftp  is  a  proper  subset  of  ft,  we  call  ftp  the  preference  zone 
and  say  we  are  dealing  with  the  indifference-zone  setting 
pioneered  by  Bechhofer  (1954).  While  if  "CS"  means  selection  of 
a  subset  S  of  £ir^ ,. . .  ,7^}  such  that  S  contains  the  population 
with  and  ftp  =  ft,  we  say  we  are  dealing  with  the  subset- 

selection  setting  pioneered  by  Gupta  (1956).)  Various  authors 
have  made  various  choices  of  the  real -valued  function  <t>(y,  p. 
For  example,  <f>  =  y'  y  (Alam  and  Rizvi  (1965),  Gupta  (1966). 
Srivastava  and  Taneja  (1972)),  §  ~  y'  y  (Srivastava  and  Taneja 
(1972)),  <p  =  c1  y  for  a  vector  of  constants  c  (Krishnai.ah  and 
Rizvi  (1956)),  the  generalized  variance  (Gnanadesikan  and  Gupta 
(1970),  i<agier  (1976)),  the  multiple 
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correlation  coefficient  (Gupta  and  Panchapakesan  (1569),  Rizvi 
■and  Solomon  (1573),  Alam,  Rizvi  and  Solomon  (1976)),  the  product 
moment  correlation  for  the  bivariate  (p  =  2)  case  (Govindarajulu 
and  Gore  (1971)),  and  correlated  variances  (Arvesen  and  McCabe 
(1973)). 

While  in  studies  to  date,  as  detailed  above,  the  multi¬ 
variate  ranking  and  selection  problem  has  been  univariatized 
(often  for  mathematical  convenience  rather  than  for  compelling 
reasons  why  one  should  regard  the  4>(u?  used  as  a  natural 
measure  of  population  goodness ;  while  we  do  not  mean  to  say  or 
imply  that  the  considered  in  the  literal' are  are  always  un¬ 
reasonable  in.  practice,  we  do  believe  there  has  lieen  somewhat 
more  emphasis  on  them  than  their  applicability  warrants ,  due 
.largely  to  their  mathematical  tractability;  for  example,  we 
are  not  awai'e  of  a  development  of  Mahalanobis  distance  showing 
it  to  be  a  iv.tural  measure  of  distance),  in  the  present  paper  we 
attempt  to  give  a  first  multivariate  solution  of  the  multi¬ 
variate  ranking  and  selection  problem.  In  particular,  our 
treatment  allows  for  such  occurrences  as  (letting  "V  denote  "is 
better  than"  or  "is  preferreu  to") 

Trl  *  1T2  >1t3>  *1  (1.1) 

which  are  well-known  in  (e.g.)  sports  (tennis,  golf,  etc.). 

While  (1.1)  is  art  anomaly  in  previous  treatments  of  multivariate 
ranking  and  selection,  it  is  to  be  expected  in  truly  multi¬ 
variate  problems  (i.e. ,  problems  in  which  one  cannot  associate  a 
univariate  measure  of  goodness,  or  number,  <J>(yi,  £),  with  a  given 
population,  but  must  rather  compare  different  (y,  £)  pairs  them¬ 
selves  in  order  to  determine  which  is  preferred) ,  and  it  is  not 
anomalous  in  our  treatment  of  multivariate  ranking  and  selection. 
For  other  approaches  to  "anomalies"  such  as  (1.1)  (besides  the 
most  common  one,  of  denying  its  existence  and  blaming  it  on 
"random  variability")  see  Good  and  Tideman  (1976),  Fishbum 
(1977b),  Young  and  Levenglick.  (1978),  and  Lee  and  Dudewicz  (1980). 
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Note,  however,  that  the  approach  presented  below  is  also 
applicable  to  the  situations  (perhaps  more  common  than  those 
described  in  the  above  paragraph)  where  there  is  associated  with 
each  population  a  numerical  "goodness"  u^  =  u(u^),  1  <_  i  <_  k, 

such  that  iTj  is  preferred  to  tt^  iff  Uj  >  (1  <_  j ,  Z  <_  k).  In 

that  situation,  we  are  able  to  allow  not  simply  linear  functions 
u(p-),  but  arbitrarily  complex  reality,  including  e.g.  quadratic, 
polynomial,  exponential,  and  power  series  or  even  Fourier  series 
functions. 

Due  to  repeated  skepticism  of  the  true  existence  of 
situations  like  (1.1),  and  as  a  contrast  with  the  situation  in 
the  above  paragraph,  an  example  of  (1.1)  nay  be  useful  at  this 
point.  Namely,  in  the  Open  Tennis  Career  Records  as  of  11/14/78 
(compiled  by  Mr.  Steve  Flink  of  World  Tennis ,  and  graciously 
provided  by  Mr.  Ron  Bookman)  one  can  let  =  Martina 
Navratilova,  tt2  =  Virginia  Ruzici,  and  =  Regina  Marsikova. 

Then  tt^  *  (by  3-0),  itj  v  71 3  (by  4-1),  but  it  3  s»  tt^  (by  2-1). 

Any  reasonable  aDDroach  assigning  u(u • )  independent  of  examine- 
tion  of  Uj's  would,  in  this  setting,  come  to  erroneous  conclu¬ 
sions  as  to  the  relative  merits  of  tt-^  and  v, . 

Briefly,  in  our  new  theory  we  let  g(p-^ , . . .  ,yk)  be  an 
experimenter-specified  function  with  range  space  (1,  2,...,k} 
and  such  that 

g(y1,. . .  ,uk)  =  j  (1.2) 

iff,  given  a  choice  among  p^,...,pk>  the  experimenter  would  pre¬ 
fer  p j .  Below  we  give  nultivariate  £reference  selection  proce¬ 
dures  which  sample  from  each  of  m-p . . .  ,tt^,  estimate  by 
}r  (i  =  l,...,k),  and  select 

✓N  A 

^g^Hl’*  •  •  (1.3) 

This  is  done  first  for  the  problem  of  selecting  tt^  where 
b  =  g(p1,. .. ,uk)  when  ^  =...=  fk  =  c2I  with  a2  known  (Section 
2),  then  when  ^  =...=  |k  =  £  with  \  known  (Section  3),  next  for 
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the  case  of  known  but  unequal  (Section  4),  and  finally 

(using  the  previous  results  as  a  base  for  The  Heteroscedastic 
Method)  in  the  general  case  when  $]_»•••*!]<.  are  unknown  and  not 
necessarily  equal  (Section  5).  (While  sections  2,  3,  and  4 
could  be  combined,  we  have  kept  their  results  separated  for 
several  reasons.  First,  our  new  approach  to  multivariate  selec¬ 
tion  is  most  easily  understood  in  the  context  of  Section  2 , 
where  the  fewest  multivariate  complexities  are  present.  Second, 
the  setting  of  Section  3  with  $  = . . .  =  $  is  that  which  The  Hetero¬ 
scedastic  Method  builds  its  solution  on.  The  probability 
expressions  developed  in  Section  4  are  useful  in  Section  5.) 
Extensions  are  then  noted  ( Section  6 ) . 

We  wish  to  emphasize  that  the  nature  of  the  preference 
function  g  is  not  a  chief  point  of  interest  in  this  paper; 
however,  our  work  holds  for  any  and  e'  ary  such  function.  The 
specification  and/or  elicitation  of  s  oh  functions  is  part  of 
the  field  of  decision  theory.  Up  to  now  it  has  dealt  with  only 
relatively  "simple"  types  of  functions.  (For  some  illustrative 
papers  developing  nonadditive  (but  still  relatively,  simple) 
utility  theory,  see  Farquhar  (1976)  and  Fishbum  (1977a).  For 
some  typical  applications  of  this  utility  theory  in  practical 
situations,  see  Giauque  and  Peebles  (1976)  and  Krischer  (1976). 
For  some  consideration  of  assessment  of  multiattribute  cardinal 
utility  functions  see  Kirkwood  (1976).  For  research  papers  and 
a  survey  of  the  field  of  multiple  criteria/objective  decision 
making,  see  respectively  Starr  and  Zeleny  (1977)  and  Hwang  and 
Masud  (1979).)  We  plan  to  investigate  such  specifications  as 
(e.g.)  polynomial  functions  g.  The  present  paper  investigates 
how  one  can  do  statistical  inference  with  any  such  function 
(even  if  not  completely  elicited  or  known) ,  and  this  has  been 
hitherto  unavailable. 
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2.  SELECTION  OF  THE  BEST  WHEN  =  ...=  ^  =  o2I,  a2  KNOWN 


Let  be  Np  (y^ ,  ^)  for  i  =  and  assume  p  >_  1 

and  =  .,.=  =  o2I.  Here  I  is  the  p*p  identity  matrix  and 

and  a2  is  assumed  known.  Let  gC^, . . ,  ,uk)  be  an  experimenter- 
specified  function  with  range  space  {1,  where  (1.2) 

denotes  the  fact  that  among  — ,Uk  the  experimenter  prefers 
y j .  The  natural  selection  procedure  in  this  setting  is  to 
observe  n  independent  observations  from  each  of 
estimate  y^  by  the  usual  sample  mean  vector  in  (1  <_  i  <_  k),  and 
select 

A  A 

Trg(V|l»---»Mk)  ‘ 

2 

Call  this  selection  procedure b- (o  I). 

Of  course  if  the  true  means 

U  -  (y-^i '  ‘  ‘  ’Hk^ 

are  such  that  g(y)  =  j  while  g(y  +  e)  =  i  (£  i  j)  for  a  matrix 


(2.1) 


(2.2) 


of  smell  numbers,  then  the  probability  of  correct  selection  of 

<a2I), 

p<CS!«twp(o2l»  ,  (2.3) 

will  not  be  much  larger  than  1/k.  We  will  therefore  show  next 
how  to  specify  the  sample  size  n  per  population  so  that ,  for  a 
reasonable  preference  zone  ftp(d*)  (d*  >0)  and  a  fixed  P* 

(1/k  <  P*  <  1)  procedure  ^M,,p(a2j)  satisfies 


MVP' 

^4(cs|'mvp<°2i))  1 


(2.4) 


Let 


Pj  =  {y:  g(y)  =  j}  ,  j  =  1,... ,k. 


(2.5) 


and  note  that  P^,...,Pk  are  disjoint  preference  sets  whose  union 

is  I^P.  Define  the  distance  between  any  two  points  a  and  b  of 

kp  .  .  ~ 

as  the  usual  Euclidean  distance 


6 


<  in 


(2.6) 


kp 


dCa,  b)  =  (  z  (a.  -  b-)2)1/2  , 

:  ;  i 

and  denote  the  distance  from  y  to  the  boundary  of  P^^  by 


dg  (y)  =  inf{d(y,  b):  b  ij:  P  •  (2.8) 

We  now  set  our  probability  requirement  for  any  procedure  7  as 
P(CS|7)  >_  P*  whenever  dg(y)  d*  . 

(That  is,  we  desire  a  selection  procedure  6  which  has  probability 
at  least  P*  of  choosing  the  true  best  population  (event  "CS") 
whenever  the  mean  vector  u  is  at  least  distance  dsV  from  mean 

vectors  where  other  populations  are  best.  This  same  require¬ 
ment  is  used  in  Sections  3  and  4  below  also.)  Whenever 
dg  (y)  >_  d*  we  have 

P(CS|*MVP(°2I))  =  P(’5  £  ^(y)5 

>_  P[d(y ,  y)  <_  dg(y)] 

>_  P[d(y,  y)  <  d*3  (2.9) 

k  p  ^  o  ~ 

=  P[  E  Z  (y -  y- „)2  <  Cd*)2] 
i=l  £=1  1X 

k  p 

=  P[  E  E  ((y.0  -  y.„)/(o// rT))2  £  n(d‘;')2/c^ ] 
i=l  £=1  1X  1X 

=  P[Y  <  n(d*)2/o2]  (2.9a) 

where  Y  has  a  (central)  chi-square  distribution  with  kp  degrees 
of  freedom.  Hence  we  obtain 

THEOREM  1.  Selection  procedure  y^p^I)  satisfies  probability 
requirement  (2.8)  if  the  sample  size  n  per  population  satisfies 
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n  -  Xkp  (P*}  °2/(d,’<)2  (2.10) 

9 

where  is  the  value  a  central  chi-square  random  variable 

with  kp  degrees  of  freedom  fails  to  exceed  with  probability  P*. 

The  problem  of  how  a  practitioner  will  in  practice  choose 
P*  and  <5*  deserves  some  comment.  First,  choice  of  P*  is 
similar  to  choice  of  power  in  tests  of  hypotheses,  and  a 
common  desire  is  for  P*  =  0.95  or  some  similarly  high  value. 

The  choice  of  <5*  is  similar  to  the  specification  in  ANOVA  of  the 
alternative  (noncentrality  parameter)  at  which  one  wishes  the 
power  guarantee.  If  the  experimenter  specifies  the  minimum 
range  between  the  largest  of  y, , . ,  u0  y.  •  and  the 

smallest  of  y,  , .  ,  u-  y,  •  which  he  wishes  to  detect  as 

>  0  units,  then  whenever 


max(M1;i,  P2ji,...,  U2>i,  •  •  •  >^<1  >  A* 

we  have  (for  any  two  possible  y's,  say  a  and  b,  which  satisfy 


(2.11))  d(a,  b)  _>  A^/2.  Hence  (since  the  of  interest  may 

vary  from  component  to  component ) ,  one  may  choose  6*  = 
min(Ap...,  Ap)/2.  This  is  similar  to  the  ANOVA  situation 
as  considered  by,  e.g.,  Scheffe  (1959),  pp.  63-64. 

Regarding  use  of  the  usual  sample  mean  vector  ir  to 


estimate  y^  (1  <  i  <  k),  it  should  be  noted  that  this  is  perhaps 
not  optimal.  However,  recent  results  of  Draper  and  Van 
Nostrand  (1977)  suggest  an  optimal  estimator  will  not  be  far 
away  from  this  estimator. 

Note  that  use  of  d(y,  b)  specifies  the  part  of  where 

we  control  the  P(CS);  it  does  not  univariatize  our  preferences. 
Also,  g  as  a  function  of  the  y^'s  alone  is  most  usual  when  the  y_. 's 
relate  to  long-term  yield  (in  decision  theory);  extension  to 
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functions  of  pis  and  (when  }h's  are  unknown;  when  they 

are  known,  the  present  results  suffice)  is  a  desirable  area  for' 
further  study. 

3.  SELECTION  OF  THE  BEST  WHEN  ^  -...=  £  KNOWN 

Let  by  Np  (p^,  for  i  =  l,...,k,  and  assume  p  >_  1 
and  ^  = ■ ■ - ”  =  £.  Here  $  is  the  conrnon  pxp  variance- 

covariance  matrix  and  is  assumed  positive-definite  and  known. 
Let  ^  ^  1  “  •  1  denote  the  characteristic  roots  of  $ . 

Let  g(p, , . . . ,p^)  be  an  experimenter-specified  function  with 
range  space  {l,...,k}  as  in  Section  2,  and  consider  using  the 
same  selection  procedure  as  in  Section  2  (but  now  denote  it  by 
^Mvp(f)  to  explicitly  take  notice  of  the  different  variance- 
covariance  structure  being  assumed).  Still  taking  our 
probability  requirement  as  (2.8),  we  wish  to  determine  the 
common  sample  size  n  per  population  in  such  a  way  that  (2.8)  is 
guaranteed.  We  will  find  the  following  result  (Rao  (1965), 
p.  50)  useful. 

LEMMA  1.  Let  A  be  any  symmetric  pxp  matrix,  let 

91  2L  —  •  *—  denote  the  characteristic  roots  of  A,  and  let 

x  be  any  p*l  vector.  Then 


9-,x'x  <  x'  A  x  <  0,_x'x 
1 -  - p~  - 


(3.1) 


Now,  choosing  A  =  $-1  in  Lemma  1  (so  0^  is  the  smallest 
characteristic  root  of  i.e.  1/Ap)  we  are  in  a  position  to 

show  that  whenever  d^(p)  i  d*  we  have 
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modification  that  now  n^  observations  are  taken  from 

(1  1  i  £  k).  Still  taking  our  probability  requirement  as 
(2.SJ,  we  wish  to  determine  the  sample  sizes  in  such  a 

way  that  (2.8)  is  guaranteed. 

Let  A-  denote  the  maximum  characteristic  root  of  t-  (i.e.  , 
1P  . _1  x . 

the  smallest  characteristic  root  of  ^  ),  i  =  l,...,k.  Using 

results  developed  previously,  it  then  follows  that  whenever 

d  (u)  >  d*  we  have  (letting  n^j  =  min(n^, . .  .  jn^)  and  Aj-^jp  = 

max(Alp>--”Xkp)) 


P{CS^MVP(t1»---»tk)) 


>  P[  E  (y.  -  y.)'(u,  -  u,-)  <  (d*)2] 

»  *  •'*  J-  ~  J~  ~  -L  — 

1  =  1 

k  .  ^  Hr  -1  "I  O 

i  «  k  "ir-  cHi  -  «i>’  4i  -  «i>  .1  X“T  (dS)  ] 

1=1  ip  LKJp 


(4.1) 


-1 


i  PC  ",  (Mi  -  Mj)'  fi  <Mi  -  Mi>  i  "[U^W 


=  Pt 


where  Y.,,...^  are  inaependent  (central)  chi-square  random 
variables  with  p  degrees  of  freedom.  Hence  we  obtain 
THEOREi  3.  Selection  procedure  %yp(|q> •  •  •  j$k)  satisfies 
probability  requirement  (2.8)  if  the  sample  sizes  n-j  , . . .  are 
such  that 


nm  >  xlCP5-)  An,^/(d*)? 


‘[1]  -  Akpv 


Tk]p' 


(4.2) 


where  y}  (P*)  is  the  value  a  central  chi-square  random  variable 
Akp 


with  kp  degrees  of  freedom  fails  to  exceed  with  probability  P*, 
n[l]  ■’■s  ’t'ie  smaHest  °f  ni>' 


the  characteristic  roots  of  $]_>••->$]<;• 


,nk,  and  Ap^  is  the  largest  of 
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I 


Using  Theorem  3,  in  the  design  problem  one  would  of 
course  choose  equal  sample  sizes  n-  =  ^  =  ...=  equal  to  the 
smallest  integer  _>  the  right-hand  side  of  (4.?).  However,  if 
unequal  sample  sizes  have  already  been  taken,  one  can 

nevertheless  easily  assess  the  smallest  diV  for  which  (2.8)  is 
satisfied  by  solving  (4.2),  as  an  equality,  for  d*. 

As  an  exact  expression  for  the  first  bounding  expression 
in  (4.1),  we  have 


p(cs^MVp(fl,...^k))  >  PC  (^  -  l^)'  (yi  -  u±)  <  <d*)2] 


i=l 


=  P[S1  +.  ..+  ^  <  (d-:)2] 


(4.3) 


where  S, , . . . , are  independent  random  variables .  This  can  be 
used  to  achieve  (2.8)  with  a  smaller  total  sample  size 
nl  +  n2  +' ' -  +  nk  ^  t*ie  des^-Sn  Problem,  via  choosing  n^, . . .  ,nk 
so  that  (4 . 3 ) 1  s  right-hand  side  equals  P*  and  +  n?  +...+  n^ 
is  minimized.  Similarly,  in  the  situation  where  n^,..  .  have 
already  been  taken,  the  smallest  ds'!  for  which  (4.3)  equals  P* 
(hence  (2.8)  is  satisfied)  will  be  smaller  than  that  provided 
by  Theorem  3,  hence  yielding  an  .improved  (strengthened)  proba¬ 
bility  requirement.  However,  since  it  is  computationally  dif¬ 
ficult  (i.e.,  expensive  and  tine  consuming)  to  evaluate  (4.3), 
we  have  emphasized  a  simpler  result  (albeit  it  requires  a 
larger  sample  size  (in  the  design  setting)  and  yields  a  larger 
d*  for  (2.8)  (in  the  data  analysis  setting))  in  Theorem  3. 

Since  (4.3)  is  of  interest  in  Section  5,  we  note  that  it 
may  be  calculated  using  the  following  facts.  Since  we  may 
represent  S-  as  S-  =  Y- '  Y-  where  Y-~N  (0,  $./n-)  (1  <_  j  £  k), 

•j  «j  mj  JT  P 

it  follows  that 
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H,(t>  =  PCs.  <  t]  -  (2TT)-P/2|L/n-|-1/2  /.../e"  fej  (tynj  ^Xjdy 

32  y^y-  <  t  ~3 


V-l. 


=  (2tt)-P/2  /.../  e4"'*dw 


(4.4) 


^  Wnj  i  1 

(where  we  utilized  an  orthogonal  transformation),  and  (from 
Pacharcs  (19S5))  that 


(n^t/2)p/2 


Ityn-I1^  ft  r(?P  +  s  ♦  l) 


z 


(-njt/2)Vp/21 


Z 


I1-L+  —  + 


*T> 


ra14)...r(i^4) 


(4.5) 


It  , 

i1 ! . . .i  ! X -1. . . X .p^ 

1  P  31  2p 


5.  SELECTION  OF  THE  BEST  WHEN  ^ , . . . ,£k  UNKNOWN ,  UNEQUAL 

Let  be  N  (p^,  |^)  for  i  =  l,...,k,  and  assume  p  >  1. 

Here  the  k  positive-definite  p*p  variance-covariance  matrices 
•»!)<.  are  assumed  unknown,  and  need  not  be  equal.  Let 
g(y^  > . • ■ ,yk)  be  an  experimenter-specified  function  with  range 
space  {l,...,k}  as  in  Section  2,  and  consider  the  same  selection 
problem  as  in  Section  2.  It  follows  from  Dudewicz  (1971)  and 
Dudewicz  and  Dalai  (1975)  that  no  single-stage  procedure 0  for 
this  problem  can  satisfy  a  probability  requirement  like  (2.8), 
namely 

P(CS|£)  >_  P*  whenever  dg(p)  i_d;';.  (5.1) 
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However,  since  (5.1)  is  the  type  of  probability  requirement  one 
would  usually  desire  in  practice,  it  is  desirable  to  seek  a  8 
satisfying  (5.1"'  in  a  broader  class  than  the  class  of  single- 
stage  procedures. 

Dudewicz  and  Bishop  (1979)  have  given  a  method,  called 
The  Heteroscedastic  Method,  which  allows  us  to  modify  procedure 
^MVP'<P  of  Section  3  into  a  procedure  which  solves  the 
present  problem.  Namely,  let  procedure  j pp  he  specified  by  a 
sampling  rule  and  a  terminal  decision  role  as  follows. 

Sampling  Rule  forP^.  Select  z  >  0,  and  integer  Uq  >  p, 


and  a  pxp  positive-definite  matrix  (a  ).  Take  observations 


from  each  and  every  population  ttc  (c  =  l,...,k)  as  follows. 


Take  n.-.  initial  observations  X  , .  . .  ,X  where  X.  •  = 


(Xcli’  Xc2.i’  * '  •  ,Xcp  (  l,2,...,nQ)  and  compute 

,  n°  no 

7=i  =  n\  £  XoU>  soij  -  <5-2> 


scij  tn0-l)  3cij  ’  i  . 


Define  the  positive  integer  N  by 


Nc  =  +  p2,  [z_1_  I  a—  sci.]  +  1}, 


(5.3) 


i,j=l 

where  [q]  denotes  the  largest  integer  less  than  q,  and  select 


P  (p*Nc)  matrices 


acr 

A  =  11 

*  *  • 

C"1N 

< 

cr 

a 

.  4 .  a 

cr  - 
pi 

cr  ,T 
pN 

c  (r  =  1,2,. . . ,p) 


in  such  a  way  that: 


1) 

2) 


and 

3) 


cr 


il 


= . . .  =  a 


cr . 
oxi 


0 


Acr  ric  =  er  where  nc  is  the  Ncxl  vector  (1,1,...,  1)'  and 
is  the  p*l  vector  whose  r^h  element  is  1  and  all  other 
elements  are  zero; 


AqA'c  =  z(arS)  t  (sJ;J),  where  Ac  =  <Acl>  Ac2  ’,*,,Acp)’® 
denotes  the  direct  product ,  and  (b1^ )  denotes  the  inverse 


of  the  matrix  (b. .),  r,i  =  l,2,...,p. 

J 

Next  take  Nc  -  n^  additional  observations  Xc  n  .^...jX^ 


and 


compute 


p  Nc 

7-1  1  a_„  X  •  0 

cr  i=l  £*l  cri£  Cl£ 


(r=l,2,. . . ,p). 


(5.4) 


For  ttc  construct  the  p-dimensional  vector  XQ  =  (Xcl, . . . »Xcp) » 
c=l,2 , . . . ,k. 

Terminal  Decision  Rule  for#^.  Take  the  same  decision 

as  rule^^p  (f)  of  Section  2  would  when  it  took  a  sample  size 
n  per  population,  had  £/n  =  z(cks),  and  observed 

(i,,...,ll  )  =  (X,  ,...,xv )  .  (5.5) 


That  is,  select 

•rrg(X15...,Xk)  .  (5.6) 

Now  from  (a  slight  extension  of)  Theorem  (2.20)  of 
Dudewicz  and  Bishop  (1979)  it  follows  that  I  z>0  such  that  we  have 
THEOREM  4.  Selection  procedure  satisfies  probability- 
requirement  (5,1).  The  constant  z  >  0  in  is  to  be  chosen 
so  that 

k  ~  ~  n 

PL  E  i7.  -  q.)'  (X,  -  y.)  <  (d*)  1  =  P*  •  (5.7) 

i=l  1  ~ 

While  the  distribution  of  (5.5)  is  independent  of 
(£p,...,$k),  it  is  very  complicated  (see  equations  (2.12), 

(2.13)  of  Dudewicz  and  Bishop  (1979)),  hence  calculation  of 
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z  >  0  which  satisfies  (5.7)  is  rot  a  simple  matter.  However, 
for  large  (a  design  constant  under  the  experimenter's  control) 
we  may  approximate  ( 5 . 7 ) ' s  solution : 

THEOREM  5 .  As  n^  +  ®  the  z  >  0  which  solves  (5.7)  approaches 
the  solution  of  (5.7)  when  is  replaced  by  (Y-^ , . . . 

where  Y^,...^^  «re  independent  random  variables  and  is 
Np(y^ ,  zp(a  )).  This  solution  may  be  calculated  from  (4.3) 
and  (4.5). 

PROOF .  This  follows  from  the  proof  of  Theorem  (5.1)  of 
Dudewicz  and  Bishop  (1979),  where  we  find  that  the  limiting  (as 
nQ  «)  distribution  of  (X-^ ,  -  - .  »X^_)  is  the  same  as  the 
distribution  of  (Y^,, . . ,Y^) . 

6 .  EXTENSIONS 

While  in  this  paper  we  have  developed  a  new  approach  for 
one  goal  of  the  area  known  (see  Gupta  and  Panchapakesan  (1979), 
p.  7)  as  "ranking  and  selection",  namely  "selection  of  the 
best,"  problems  such  as  the  selection  of  the  best  t  (of  k) 
multivariate  populations,  as  well  as  selection  of  a  subset 
containing  the  best  multivariate  population,  can  also  be  given 
solutions  using  the  methods  of  this  paper.  Details  will 
appear  elsewhere. 
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